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SUBGEOMETRIC ERGODICITY OF STRONG MARKOV 

PROCESSES 

By G. Fort and G. O. Roberts 

CNRS/LMC-IMAG and Lancaster University 

We derive sufficient conditions for subgeometric /-ergodicity of 
strongly Markovian processes. We first propose a criterion based on 
modulated moment of some delayed return-time to a petite set. We 
then formulate a criterion for polynomial /-ergodicity in terms of a 
drift condition on the generator. Applications to specific processes 
are considered, including Langevin tempered diffusions on R n and 
storage models. 

1. Introduction. This paper is devoted to the study of subgeometric /- 
ergodicity of a strong Markov semigroup (P t )t>o- That is, for a subgeomet- 
rically increasing rate function r := (r(t))t>o, and a Borel function / > 1, we 
propose sufficient conditions implying the limit 

lim r(t)\\P t (x,-)-n(-)\\ f =0, 

t— >+oo 

for 7r-almost all (a. a.) x, where tt is the unique invariant probability measure. 
Our main condition is couched in terms of modulated moments of return- 
times to a "test-set." In this form, this condition extends earlier criteria, 
implying different notions of stability (such as Harris-recurrence, positive 
Harris-recurrence, ergodicity and /-ergodicity) for continuous-time Markov 
processes. This condition is also analogous to the criterion for subgeometric 
/-ergodicity of discrete time Markov chains. We also derive a condition for 
polynomial ergodicity which is easy to check in many applications. This 
condition is expressed in terms of inequality on the semigroup generator, 
and is analogous to the so-called drift inequality in the discrete-time case. 

We apply our results to the study of strongly Markovian processes, giving 
three nontrivial examples, two of which are of considerable applied proba- 
bilistic interest. We first consider a simple jump process as a toy example, 
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demonstrating that /-ergodicity at a logarithmic (resp. polynomial or subex- 
ponential) rate is narrowly related to the existence of a logarithmic (resp. 
polynomial or sub-exponential) moment of the mean-time spent in each 
state, with respect to the jump distribution. We then consider Langevin 
tempered diffusions on M. n which are relevant to Markov Chain Monte Carlo 
(MCMC) techniques since they construct a diffusion process with given sta- 
tionary distribution ir (which only needs to be available up to an unknown 
normalization constant). When the stationary distribution is polynomial in 
the tails, the (simple) Langevin diffusion can not be ergodic at a geometric 
rate and we show that it is polynomially ergodic. We also consider Langevin 
tempered diffusion in which the diffusion matrix is a scalar matrix with co- 
efficient 7r~ M , d > 0, and prove that even when the target distribution is 
polynomial in the tails, a convenient choice of the temperature d involves 
geometric ergodicity of the process. Finally, we study a compound Poisson- 
process driven Ornstein-Uhlenbeck process which is used in storage models 
and more recently in financial econometrics. It is known that when the distri- 
bution of the jump F has sufficiently light tails, the process is geometrically 
ergodic. We investigate the case where F is heavy tailed and establish the 
sub-geometric ergodicity of the process under appropriate conditions in this 
case. 

The paper is organized as follows. We first recall basic definitions on 
Markov process, as well as reviewing existing results on ergodicity of strongly 
Markovian process. The new criteria for subgeometric ergodicity are given 
in Section 2, and the proofs are postponed to Section 4. Section 3 is devoted 
to the three examples mentioned above. 

1.1. Basic definitions on Markov process. Let X be a locally compact 
and separable metric space endowed with the Borel cr-field B(X). X = 
(ft, A, (^t)t>o, (Xt)t>o^x) is a ^-valued Borel right process so that it is 
a temporally homogeneous Markov process, strongly Markovian with right- 
continuous sample paths (see, e.g., [31]). F x (resp. E x ) denotes the canonical 
probability (resp. expectation) associated to the Markov process with ini- 
tial distribution 5 X , the Dirac distribution at point x. Let (P t )t>o be the 
associated Markov semigroup. 

We recall basic definitions and properties on Markov process that will be 
used throughout this paper. The process X is ^-irreducible for some cr-finite 
measure </> on B{X) if 



If the process is cj)-\i reducible, there exists a maximal irreducibility measure 
ij) that dominates any irreducibility measure [24]. In fact, if tt is an invariant 
measure, that is, TrP t = P l for all t > 0, then 7r is a maximal irreducibility 
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measure. Any measurable set which is of positive -^-measure is said to be 
accessible. A set C 6 B{X) is ivpetite for the process (or simply petite) if 
there exists a probability measure b (resp. nontrivial u-finite measure v^) on 
the Borel er-field of K+ [resp. on B(X)] such that 



A ^-irreducible process always possesses an accessible closed petite set ([20], 
Proposition 3.2). A process is Harris-recurrent if there exists a cr-finite mea- 
sure 4> such that 



or, equivalently, if there exists a <7-finite measure [i such that fJ-(A) > ==>• 
W x (ta < oo) = 1 for all x S X where ta is the hitting time on A. Harris- 
recurrence trivially implies 0-irreducibility. A Harris-recurrent right process 
possesses an invariant measure [10]. In fact, when the invariant measure is 
finite, X is called positive Harris-recurrent. A ^-irreducible process is aperi- 
odic if there exist an accessible -petite set C and to such that P t (x, C) > 
for all x £ C,t> to- Meyn and Tweedie ([22], Proposition 6.1) show that a 
positive Harris-recurrent process is aperiodic if some skeleton chain P m , 
m > 0, is irreducible, that is, if there exists a <7-finite measure <fi on B(X) 
such that <j)(A) > =>• E x E n > t A (X nm )} > for all x & X. 

For Borel functions f >1, g, define the norm \g\f := sup x \g{x)\/ f{x) and 
the Banach space Cf := {g, \g\f < oo}. For a signed measure fj,, the total 
variation norm is given by ||/x||tv := sup^/^A) — inf yi/i(A); and the /-norm 
(for some Borel function / > 1), \\fj,\\f := su P{ s ,| s |f=i} Im(s)Ij so that the total 
variation norm is the 1-norm, where 1 denotes the constant function l(t) = 
1. The process is ergodic if 



Finally, X is geometrically (resp. subgeometrically) /-ergodic if the limit (1) 
holds at a rate r(t) := k*, for some K > 1 [resp. r := (r(t))t>o for some 
subgeometrically increasing rate]. A subgeometric rate is defined as fol- 
lows (see, e.g., [34]). Let Ao be the set of the measurable, bounded on 
bounded intervals and nondecreasing functions r:M_|_ — ► [l,oo), such that 
log r(t)/t i as t — > +oo. Let A be the set of the rates f := (f(t))t>o such 
that for some r £ Ao, < lim inf tf(t)/r(t) < limsup t f{t)/r{t) < oo. A is by 
definition, the set of the subgeometric rates. For example, A contains rates 





Vx£ X 



lim 11^(2;, -) - 7t(-)||tv = 0, 

t— >oo 



and /-ergodic if vr(/) < oo and 

(l) Vxex, 



hm ||P*(^.)-7r(0||/ = O. 
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such as r(t) ~log /3 (t+ 1), (3 > 0, r(t) ~ (1 V i a ) log^t + 1), a > 0, (3 £ R, and 
sub exponential rates f(i) ~ exp(at^), a > 0, < /? < 1. 

Throughout this paper, we will often make comparison with (discrete 
time) Markov chains; the unfamiliar reader can refer to [21]. 

1.2. (/, r) -modulated moments and stability. Define the hitting-time on 
a measurable set C, delayed by 5 > 0, 



the moment tc(0) is denoted by tq- It is proved in the literature that mod- 
ulated moments of tc(S) for some closed petite set are related to Harris- 
recurrence, positive Harris-recurrence, /-ergodicity and geometric /-ergodicity. 
For a Borel function / > 1, an increasing nonnegative rate function r = 
(r(t))t>o, 5 > 0, define the (/, r)-modulated moment 



Rl. X is Harris-recurrent if and only if there exists a petite set C such that, 
for all x £ X, F x (t c < oo) = 1 ([20], Theorem 1.1). 

R2. If A" is Harris-recurrent with invariant measure it, then for / > 1, vr(/) < 
oo if and only if sup.j.gc Gc(x, f, 1; 5) < oo for some closed petite set C 
([20], Theorem 1.2). 

R3. A positive Harris-recurrent process is ergodic if and only if some skele- 
ton chain P m is irreducible ([22], Theorem 6.1). 

R4. A positive Harris-recurrent process is /-ergodic if (a) some skeleton 

chain P m is irreducible, (b) sup^g^ Gc(x, h, 1; 5) < oo, where h > sup s<m P s f 
and C is a closed petite set, and (c) for all x, Gc(x, h, 1; 5) < oo ([20], 
Proposition 4.1 and [22], Theorem 7.2). 

R5. A positive Harris-recurrent process is geometrically /-ergodic if (a) 
some skeleton chain P m is irreducible, (b) there exists a closed pe- 
tite set C and r\ > and Gc{x, h, exp(r/t); 5) is finite for all x, where 
h > 1 is a Borel function such that c\f < f£° exp(—t)P t hdt < C2/ for 
some finite positive constants Cj, (c) sup c Gc(-, h, exp(r/t); 6) < 00 ([5], 
Theorem 7.4). 

In Section 2 we give a criterion of the form R1-R5 that implies subgeometric 
/-ergodicity. 

To date, little is known about general characterizations for /-ergodicity 
at a subgeometric rate for Markov processes. However, we note some im- 
portant special cases which have been studied in the literature. The work 
by Ganidis, Roynette and Simonot [9] is restricted (a) to convergence in 



(2) 



T C (6):=M{t>S,X t eC} 
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total variation norm and (b) to diffusion processes on M. d with diffusion ma- 
trix equals to identity. Their proof is based on spectral properties of the 
transition semigroup seen as an operator, and differs from the probabilistic 
approach adopted in the present paper. We will see in Section 3.2 how to 
improve their conclusions. Veretennikov [36] and Malyshkin [19] deal with 
diffusion processes and can be read as a special application of the present 
paper. The most related work to the present one is the paper by Dai and 
Meyn [2] that considers /-ergodicity at a polynomial rate of a Markovian 
state process, in order to study the stability of multi-class queuing networks. 
These results are particularly related to our work and we will describe their 
results in Section 2. 



1.3. Drift condition and generator. For a Borel function < V < oo, 
denote by AV the Borel function — when exists — such that t \— > AV(X t ) is 
integrable P x -almost surely (a.s.), and there exists an increasing sequence of 
stopping-time {T n } n such that for any stopping time r, 



V(X TATn ) - V(X ) - / AV(X S ) ds 



TAT n 



(I 



for all x G X,n > 0. 



When AV exists, V is said to be in the domain of A. If there exists h such 
that 1 1 ^ h(X t ) is integrable P x -a.s. and t h-> V{X t ) - V(X ) - ^h(X s )ds 
is a right-continuous P x -local martingale (with respect to the filtration !Ft), 
then V is in the domain of A and AV = h [3] . If V is in the domain of the 
weak infinitesimal generator A, then V is in the domain of A and AV = AV 
[6]. If the functions V and AV are right-continuous, these two sufficient 
conditions are equivalent and AV = AV. 

When AV satisfies a drift condition AV < —f+btc f° r some closed set C, 
and a nonnegative function / such that t ' — > f(X^ is integrable P^-a.s., we 
have Gc(x, /, 1; 6) < V(x) + 5b; this will be the basic tool to upper bound 
the (/, r)-modulated moments. 

Conditions on AV are analogous to conditions on the variation P m V — V 
for a discrete time Markov chain with transition kernel P m . It is well known 
that the condition P m V — V < — f outside a "test set" for the skeleton 
pm ^ g re i a ted (a) to the /-ergodicity of the Markov chain (X^)^ [21]; (b) 
to the geometric F-ergodicity if / = XV for some < A < 1 [21]; (c) to the 
polynomial ^/-ergodicity if / oc V l ~ a for some < a. < 1 [8, 14]; (d) and more 
generally subgeometric /-ergodicity is / oc (ft(V) for some concave function (j> 
[4]. Similar results hold for a continuous Markov process. Meyn and Tweedie 
[23] prove that the condition AV < —f outside a closed petite set is related 
(a) to the /-ergodicity of the Markov process X; and (b) to the geometric 
^-ergodicity if / oc V (see also [5, 26, 29]). In Section 2 we establish that 
the case / oc V 1-Q? is related to polynomial /-ergodicity. 
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2. Statements of the results. In Theorem 1 , we establish that modulated 
moment on some delayed hitting-time on a closed petite set C provides a 
criterion for subgeometric /-ergodicity. We assume that there exist 5 > 0, a 
Borel function /* > 1 and a rate function r* £ A such that 

(3) supGc(-,/*,l;<5) < oo, supGc(-,l,r*;£) < oo. 

c c 

We will establish that is the maximal rate of convergence (that can be 
deduced from these assumptions) and it is associated to convergence in total 
variation norm, that is, in 1-norm, which is the minimal one. On the other 
hand, we will show that /* is the largest norm in which convergence occurs 
and the associated convergence rate is the minimal one 1. 

Using an interpolation technique, we also derive a convergence rate 1 < 
r f < r * in A associated to some /-norm, 1 < / < /* (see [4] for a similar 
approach in the discrete time case). The simplest interpolation technique is 
given by Hlder's inequality which yields [from (3)] sup c Gc(-, f* , ?"*~ p ; 5) < 
oo. By analogy to the discrete-time case, one would expect convergence in 
/f-norm at the rate rl~ p , and we will prove the continuous time version of 
this result. 

More generally, if there exists a pair of nondecreasing positive functions 
^2) satisfying 

(4) ^>i(x)^ 2 {y)<x + y, x,y>l, 

then sup c G c (-,^2{f*),^i{r*);5) < 00. We will establish that if \Pi(r*) € 
A, this condition yields convergence in ^2(/*)- norm at the rate \I/i(r*). 
Young functions are closely related to these pairs of functions (^1,^2)) 
say. Specifically, if (H\,H2) is a pair of Young functions, then (H^ 1 , H^ 1 ) 
satisfies (4) (see, e.g., [16], Chapter 1). Let I be the set of pairs of inverse 
Young functions, augmented with the pairs (Id, 1) and (1, Id). As commented 
above, X contains the pairs ((x/p) p , (y/(l — p)) 1_p ), < p < 1, and, more 
generally, the pairs of functions increasing at infinity as (x p ln 6 x,y 1_p ln _6 y) 
for some < p < 1 and b £ M, p = and b > 0, p = 1 and b < 0. 

Theorem 1. Let /* > 1 be a Borel function and r* G A. Assume that: 

(i) X is Harris-recurrent with invariant measure it, and some skeleton 
chain, say P m , is ip -irreducible. 

(ii) There exist a closed petite set C and some 5 > 0, such that (3) holds. 

(iii) There exists a finite constant c such that sup t<m P*/* < c/*. 

Then tt is an invariant probability measure, 7r(/*) < 00 and for any pair 
*:=(*l,* 2 )6l, 

lim {^ 1 (r,(i))Vl}||P t (x,-)-vr(-)|k 2(/ ,)vi = forallxeS^, 
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where S^,, which is of ir -measure one, is defined by 

-r c (S) 



x G X,~K X 



*i(r*(s))* 2 (/*(X s ))d S 



< oo 



The proof of Theorem 1 is postponed to Section 4.1. We first verify that 
^i( r *(t)) V 1 G A. Under (i) and (ii), C is accessible and the following lemma 
holds ([22], Proposition 6.1). 

Lemma 2. Suppose that X is positive Harris-recurrent with invariant 
distribution tt and some skeleton chain P m is irreducible. For any accessible 
petite set C , there exist to > and an irreducibility measure v for the process 
such that v{C) > and inf^gc inft>t P t (x, •) > v(-). 

Based on this lemma and on condition (ii), the second step consists in 
proving that the skeleton P m is irreducible, aperiodic and possesses a petite 
set A such that 

su P G^ m) (x,^ 2 (/,),^i(r,))<cx) 



(5) 



with G { ™\x,f,r)=E x 



J2 r(k)f(X krn ) 



L k=0 



where T m ^ > 1 is the return-time to A for the skeleton chain P m 
(6) T mjA :=m?{k>l,X km eA}. 

By application of [35], Theorem 2.1, this proves that for 7r-a.a. x, limfc_ >+00 i(r*(k))\\P krn (x , •) ■ 
7r (')\\^2(f*) = 0- Using Theorem l(iii), the limit still holds replacing r*(fc) 
(resp. P km ) by r*(t) (resp. P*). We finally establish that the limit holds for 
all the points x G and ir{Sq) = 1. 

Remark 3. Theorem 1 remains valid by substituting condition (i) for 
the following condition: there exist a "0-hreducible, aperiodic and positive 
recurrent transition kernel P m . 

Theorem 1 remains valid by substituting (ii) and (hi) for the follow- 
ing condition: there exist a closed petite set C and some 5 > such that 
sup c Gc{-, h, 1; 5) < oo and sup£< Gc(-,t, r*;5) < oo where h > sup t<m P* /*. 

Condition (hi) implies that the semigroup (P t )t>o and the resolvent kernel 
R = f °° exp(—t)P t dt are bounded on Cf f . 

Remark 4. By (4), it is readily seen that {x; Gc(x, /*, 1; 5) + Gc(x, 1, r*; 5) < 
oo} C Si},. It may be read from the proof that 



(7) 



JimJ^Mt)) V l}||/iP'(-) - M-)lk 2 (/,)vi = 0, 
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for all probability measure \i such that Gc(x, ^(i*) V l,^i(r*) V 1;5) is 
/i-integrable. Applying again (4), (7) holds for all distribution // such that 
{Gc(x, /*, 1; 5) + Gc(x, l,r*; 5)} is /i-integrable. 

Remark 5. For any pair (\l/i, Vl/2) G ^"j if ^1 strongly increases at infinity 
[e.g., ^i(x) oc x p for some p < 1 close to one], then ^2 slowly increases 
[^(^O oc x l ~ p for some 1 — p close to zero] ([16], Theorem 2.1, Chapter 1). 
Hence, the stronger the norm, the weaker the rate (and conversely). This 
compromise between the rate function and the norm of convergence is well 
known for the discrete parameter Markov chain ([35]; see also [4, 8, 14]). 
As expected, this property remains valid for the continuous-time Markov 
process. 

Corollary 6 provides a condition based on A, well adapted to prove poly- 
nomial ergodicity. 

Corollary 6. Let 1 < V < 00 be a Borel function and < a < 1. As- 
sume that: 

(i) Some skeleton chain P m is irreducible. 

(ii) There exists a closed petite set C such that sup c V < 00 and for all 
a < 77 < 1, 1 1 — ► V ri ~ a (Xf) is integrable ¥ x -a.s. and 

(8) AV < -c^V^ + bl c , < b < 00, < c r , < 00. 

Then there exists a unique invariant distribution it, 7r(y 1_Q ) < 00 and for 
all <p < 1 and b G 1R or p = 1 and 6 > or p = and < 0, 

i ilmjl + i)( 1 -^ 1 - a )^ 

x G A". 

The proof is given in Section 4.2. From (ii), we obtain Gc(x, V 1-0 , 1; 5) + 
Gc(x, t, (1 + t) 1 ' 01-1 ; 5) <cV(x); and then we apply Theorem 1. 

By choosing 6 = and p = (1 — ra) /(l — a) for some 1 < k < 1 /a, Corol- 
lary 6 yields 

(9) Via, limft + lY^IIP^av) -7r(-)l|yi-««, = 0. 

t— >oo 

If (9) holds for some V function, we shall say that the Markov chain is 
polynomially ergodic with rate (1 + t)^~ a ^ a . 

Remark 7. Corollary 6 can be compared to the paper by Jarner and 
Roberts [14] for the discrete parameter case. They start with proving that 
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if there exist a Borel function 1 < V < oo, < a < 1, a set C such that, for 
all a < j] < 1, 

(10) P m V v -V v <-c v V v ~ a + bt c , 0<6<oo,0< C?y <oo, 

there exists c < oo such that (•, 1) + (-,1, (1 + 1) 1 /"-!) < C V , 

where is given by (5). The drift condition (10) is analogous to (8) and 

the controls of the moments G^ and Gc are similar. If, in addition, P m is 
irreducible, aperiodic and C is petite for the skeleton, P m is positive with 
invariant distribution ir such that 7r(y 1_a ) < oo and for all 1 < k< 1/a, the 
skeleton is y 1-KQ -ergodic with rate (n + 1) K . These rates coincide with 
those in (9). 

Remark 8. From the proof of Corollary 6, it may be read than only a 
finite number of nested drift conditions is required; nevertheless, in prac- 
tice, it is not more restricting to verify a continuum of drift conditions 
than to verify a finite number of drift conditions. More precisely, assump- 
tion (ii) can be substituted for the following conditions: (iii) there exist 
a closed petite set and functions 1 < V q -\ < cf q , such that for all inte- 
gers 1 < q < p, AV q < —f q + blc, t i ^ fq(Xt) is integrable P x -a.s., and 
sup c V^, < oo; (iv) there exists (3 > such that Ea;[r^] < fi(x). If such, fol- 
lowing the same lines as in the proof of Proposition 26, it may be proved 
that G c (-,1, (t + iy~ 1+l3 - 5) + G c (; f P , 1; S) < cV p for some 6 > 0. Together 
with condition (i), this yields /i^-ergodicity at a rate (t + 1)(p _1 +^) ? ? for 
all < r) < 1, where /* is any function satisfying sup t<m P*/* ^ fp- 

Dai and Meyn [2] are, to our best knowledge, the first to exhibit this 
kind of nested drift condition and, hence, the first to address ergodicity at 
a polynomial rate; they proved this yields /i -ergodicity at a rate (t + l) p_1 
(Theorem 6.3, [2]). We are able to obtain the same result: to that goal, we 

observe that conditions (iii) and (iv) are verified with functions fk < fp^ P , 
(3 = 1 (as a consequence of Proposition 5.3 and equation (6.1) in [2]) and 
/* oc fp. 

We proved that nested drift conditions on the generator A provide a 
control of moments Gc with a polynomially increasing rate function. The 
converse seems to be an open question. We, nevertheless, make mention of 
Propositions 5.4 and 6.1 in [2], that provide a (partial) converse condition: 
from the condition sup^ Gc(-, f, 1; 6) < oo, they deduce a drift condition on 
A (we point out that this single condition implies a continuum of conditions 
by using the same convexity argument as in [14], Lemma 3.5). Unfortu- 
nately, this drift condition, in turn, implies only a control of the moment 
Gc(x,Rf,t;5), where Rf(x) is a function, which is, in general, difficult to 
compare with /. 
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3. Examples. In this section (•, •) and | • | denote, respectively, the scalar 
product and the Euclidean norm in M. n . If it is a twice continuously differen- 
tiable real- valued function on W 1 , Vu (resp. V 2 u) denotes its gradient (resp. 
its Hessian matrix); and du/dxi its partial derivative with respect to the ith 
variable. For a matrix u, Tr(ii) stands for the matrix trace and u' the matrix 
transpose. For r £ A, define the sequence r° by r°(t) = Jq r(s)ds. Finally, we 
largely make use of the inequality r(s + 1) < r(s)r(t), s,t>0, which holds 
for any rate r £ A ([34], Lemma 1). 

3.1. Toy example: jump process. Consider the jump process on such 
that given that Xt = i, the waiting-time to the next jump has an exponential 
distribution with expectation X^ 1 and is independent of the past history. We 
assume that for all i > 0, Aj > 0, and supj> Aj < oo. The probability that 
the jump leads to state j is given by the matrix entry Q(i,j). We consider 
the case when Q(0,i) = pi and Q(i,0) = 1 for all i> 1, for some positive 
sequence {pi)i>\ such that J2i>iPi = 1- We assume, in addition, that 

(11) liminfAj = and V'piAj" 1 < 



i>0 



OO. 



Since supj> Aj < oo, there exists a Z + -valued right-continuous strong Markov 
process satisfying the heuristic description above and such that, for all 
£ the limit exists 

(12) hm =:A(*,j)<oo, 

where 5i is the Dirac-mass at point i, and for all i > 1, 

(13) A(0,0) = -A , A(0,i) = \ oPh A(i,0) = -A(i,i) = A i , 
and A(i, j) = otherwise (see, e.g., [7], page 330). 

Lemma 9. The process is Harris-recurrent, reversible with invariant dis- 
tribution 7r given by 7r(0) = {1 + J2j>iPj^J 1 }~ 1 an d = piA i " 1 7r(0), i > 1. 
Any skeleton chain P m is irreducible. 

Proof. We have E»[ro] = (1 - lo(^))A 4 _1 and for all i > 0, j ^ i, 8 > 0, 
P s (i,j) < p r Then, Ei[r (6)} = S + £,>i P 5 (i, [r ] < 6 + 2J2 J > lPj Xj 1 . 
Hence, for all i G Z + , Pj(ro(5) < oo) = 1 and as {0} is a closed petite set, the 
process is Harris-recurrent, tt is the unique invariant probability measure (as 
unique solution to ttA = 0), and since X obeys the detailed balance, that is, 
ir(i)A(i,j) =vr(j)A(j, i) for all i,j, the process is reversible. Finally, for all 
m > 0, and i, j > 0, 



P m (i,j) >PjX XiXj dsexp(-Xis) 
Jo 

rm—s pm— (t+s) 



X 



rm—s pm—(t-ts) 

/ <itexp(— Xot) / <iuexp(— Xju) > 0, 

Jo Jo 
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where the inequality says that P m (i,j) is greater than the probability of 
a single visit to before a jump to j. Similarly, it is easy to prove that 
P m (0,j) > and P m (j,0) > for any j G Z + . This proves the irreducibility 
of any skeleton. □ 

We deduce from R3 that the process is ergodic. Nevertheless, this conver- 
gence fails to occur at a geometric rate as shown in Lemma 10, the proof of 
which relies on the notion of conductance. 



Lemma 10. X fails to be geometrically ergodic. 

Proof. As X is reversible, any Markov kernel P m is reversible. It is 
proved in [17] that for a reversible Markov kernel P m , the conductance K m 
given by K m := mi A c m (A), where c m (A) := {vr( J 4)^(A c )}- 1 P m (x, A c )ir(dx), 
is positive if and only if P m is geometrically ergodic. We verify that for any 
skeleton P m , the conductance is zero which will involve that the skeleton fails 
to be geometrically ergodic. Consider the set of states i such that n(i) < 1/2. 
Then c m (i) < 2(1 - P m (i,i)) < 2(1 - exp(-Ajm)) upon noting that P m (i,i) 
is lower bounded by the probability that the waiting-time in state i is greater 
than to. Since liminfj^ +00 Aj = 0, for all e > 0, there exists a state i such 
that c m (i) <e, which involves K m = 0. □ 

We now identify functions V that are in the domain of A. 

Lemma 11. Let < V < oo be a Borel function such that J2i>iPiV(i) < 
oo. Then V is in the domain of A and AV = AV . 

Proof. For a function / > such that J^jPjfU) < °°' the monotone 
convergence theorem yields 

^ I'm / f ^ = AT 2^ V m / f ^ = A/W; 

rr: tio t JT+oo r-* tio t 

in addition, sup^ + / |A/| < cxd. This proves that V is in the domain of the 
weak infinitesimal generator A, and thus in the domain of A. □ 

The expression of the generator suggests that function V on the form \ fi p 
is a candidate to solve the drift inequality (8). This yields /-ergodicity at a 
log-polynomial rate. 

PROPOSITION 12. Assume that there exists (3>1 such that J2i>iPi^i < °°- 
Then for all i £ Z +; < k < j3 — 1 and b G R or k = and b<0, or k = (3 — 1 
and b>0, 

lim (1 + ty»-*-»pn(l + t)] b \\P% •) " *(0II wwi+a^)]-* = °- 
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Proof. We apply Corollary 6: we choose V > 1 such that for all i > 1, 
= c~ V(0)Ar /3 for some c > ^(0). Then (8) is verified with a = j3 1 and 
the closed petite set C = {0}. □ 

When j3 = 1 [i.e., with nothing more than the condition (11)], this es- 
tablishes the convergence in total variation norm at the rate 1, which cor- 
roborates the ergodicity of the process proved above. Nevertheless, if for 
some P > 0, the sum J2i>iPi(l v \ _1 )[l°g(l v \~ )F exists, Corollary 6 does 
not yield a stronger convergence result than the ergodic one. We prove, by 
application of Theorem 1, that covers more general rates than the poly- 
nomial ones, that convergence in total variation norm occurs at the rate 
?%,(£) ~ [log(t)]^, and convergence in norm /*(x) = [log(l V A^T 1 ) + I] 13 occurs 
at rate 1. We also derive sufficient conditions for subexponential ergodicity. 

Lemma 13. Let /* : r L + — > [1, oo) and r t 6A such that 

Y^Pji 1 VAJ 1 )/*^') < oo and 
i>i 

(14) " 

r+oo 



/Pj^T / r*(s)A,-exp(— A,-s)ds < 

3>1 J ° 



OO. 



Then there exists a finite constant c such that for all m > 0, sup 4<m P*/* < 
c/* . For all 5 > 0, there exists a finite constant c such that 

G (xJ*,t;5)<c(lV\- 1 )Mx), 

r+oo 

Go(x,l,r*;6) < c / r*(s) exp(-A x s) ds. 
Jo 

Proof. Since P t (x,j) <Vj f° r au x 7^ j\ h is trivial to prove that 
sup t>0 sup i6Z+ f~ l P l f* < oo. For / > 1 and r G A, 

G (*, /, r; <5) < /* r(s)P s f(x) ds + r(5) ]T P 5 (x,j)f(j)E J [r°(r )]. 

Ejfr ^)] = \ j Jr°(t)exp(-\ j s)ds = fr(t)exp(-\ j s)ds. □ 

Proposition 14. (i) Assume that Ei>iPi(! v \ rl )[l°g(l v < 00 > 

for some (3>0. For all0<K<(3, i G Z+, lim t _ >+(X) [log(i + l)] /3 - K ||P'(i, •) - 

(ii) issame 2~)i>iP«(l V -V" 1 ^ exp(z 2 A~ 1 ) < oo, /or some z > 0. 
For a//0<p<l,iGZ+ ; lim^ +oo exp(2z(l-p)t 1 /2)||pt( i) .)_^(.)|| [i+A _ 1/2cx ^ 2A _ 

0. 
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Proof. In both cases, apply Theorem 1; for case (i), set r*(i) = {log(exp(/3 — 
l) + t} /3 and = l + log(l VA,^ 1 )^; and for case (ii), set r*(t) = exp(2zt 1 / 2 ), 

f*(i) = 1 + Aj 1//2 exp(z 2 A~ 1 ) and observe that / exp(2zs 1//2 )Aexp(— As) ds < 
l + 20FzA- 1 / 2 exp(z 2 A- 1 ). □ 

3.2. Langevin tempered diffusions on R n . Let us consider a stochastic 
integral equation 

(15) X t = X + [ t b(X s )ds+ [ t a{X s )dW s , 

Jo Jo 

where Wt is an n-dimensional Brownian motion, the drift coefficient b = 
. . . , b n )' is on the form, 1 < i < n, 




where a = a a' is the n x n symmetric positive definite matrix. Such a dif- 
fusion is the so-called Langevin diffusion and is defined in such a way that 
7r is, up to a multiplicative constant, the density of the unique invariant 
probability distribution (with respect to the Lebesgue measure on W 1 ). This 
property motivates recent interests in Langevin diffusion for their use as 
MCMC methods, where the scope of these techniques is to draw samples 
from a Markov chain with given stationary density tt. The efficiency of these 
algorithms is linked to the rate at which /-moments K x [f(Xt)] converge to 
the constant 7r(/). This motivates the study of the /-ergodicity. In practice, 
discretizations of the continuous-time process are used to solve the MCMC 
simulation problem and recent works proved that it is possible to find meth- 
ods of discretizing which inherit the convergence rates of the continuous-time 
diffusion (see [27, 28, 30, 32, 33] for methods of discretizing and their use 
in MCMC techniques). Roberts and Tweedie proved that, on the real line, 
when the target density tt is heavy tailed, the Langevin diffusion with a := 1 
cannot be geometrically ergodic. We complement this assertion when tt is 
polynomial in the tails, and prove that the Langevin diffusion in the one- 
dimensional well as in the multidimensional one, is /-ergodic at a 
polynomial rate. For such polynomial target density on the real line, it was 
observed in [13] that the polynomial rate of convergence of the Metropolis- 
Hastings algorithm could be improved by choosing a heavy-tailed proposal 
distribution. This idea, when adapted to the diffusion on the real line, sug- 
gests the choice of a speed measure, that is, of the coefficient a such that 
a is small when the process is close to the modes of tt and big when far 
from the modes [32]. In the multidimensional case, this suggests a{x) on the 
form ir~ 2d (x)l n , where I n is the identity matrix on W 1 , d > 0. In that case 
(d > 0), we call these processes Langevin tempered diffusion (see [27] for the 
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justification of these heated diffusions). It was observed in the literature that 
by choosing d large enough, a diffusion on the real line with target density 
polynomial in the tails is geometrically ergodic. We investigate the behavior 
of this Langevin tempered diffusion in the multidimensional case, contrarily 
to most of the mentioned contributions that cover the one-dimensional case. 
In Theorem 16, it is proved that, up to some critical temperature d*, the 
diffusion is polynomially ergodic and that the larger d, the better the rate. 
When d>d*, the diffusion is geometrically ergodic. We henceforth consider 
a diffusion matrix a(x) = a 2 (x)I n , where cr(x) := ir~ d (x) for some d > 0. 
Assume the following: 

Al. 7r is, up to a multiplicative constant, a positive and twice continuously 
differentiable density on M n (with respect to the Lebesgue measure). 

Define the drift vector 



Under Al, the coefficients b and a are locally Lipschitz-continuous, which 
implies that for any compact set /C, sup a . g / C {|6(x)| + |<t(x)|}(1 + |x|) _1 < 
oo. These local conditions allow the construction of a continuous process 
satisfying the stochastic integral equation (15) up to the explosion time £ := 
linin^oo £ n , where ( n := inf{t > 0, | X t \ >n}. We thus formulate the following 
assumption: 

A2. The process is regular, that is, C = +°o a.s. 

Under Al, a sufficient condition for regularity is the existence of a twice 
continuously differentiable nonnegative function V and a constant c > such 
that LV < cV on R n and lhn^ oo inf| !B |> n V'( a = +oo ([12], Theorem 3.4.1), 
where L is the elliptic operator 



In the one-dimensional case, Has'minskii ([12], Remark 2, page 105) estab- 
lishes that the process is regular if d is chosen such that 



since the function V(x) :=sign(x) Jq Q(y) dy, where lnQ(x) = —2 Jq b(t) x 
a~ 2 (t) dt = (2d - l)(lnvr(x) - lnvr(O)) is finite and satisfies LV = on R. To 



(16) 






(17) 
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cover the multidimensional case, we adapt this condition and claim that the 
process is regular if d is chosen such that 

(18) f t 1 ~ n exp(-(l-2d) f s^ 1 sup (V log it (x),x)ds\ dt = +oo. 

Jr \ Jr {x,\x\=s} / 

Indeed, the function V(x) := ?7(|a;|) where, for all u > 0, 

u{u) := f ex K-/\,SL}{(^M'R) + n ~w) ds ) dt ' 

is finite and satisfies LV = on W 1 . In the one-dimensional case, condition 
(17) is necessary for the existence of an invariant probability measure ([12], 
Remark 2, page 105); thus, for the objective of this paper, d has to be chosen 
in the set T>\ of the positive real numbers such that (17) hold. Observe 
that T>\ is nonempty and contains {0,1/2}. In the multidimensional case, 
a necessary condition for (positive) recurrence is that d checks a condition 
on the form (18) where the supremum is replaced by the infimum ([11], 
Theorem II, page 194). This involves the definition of an interval T) n limiting 
the range of the possible temperature d. 

Under Al and A2, there exists a solution (ft, J 7 , (J r t), (Wf)> (X^),P), where 
(fi, J-, (J r t), (Wt),^) is n-dimensional Brownian motion, (Xt)t is an .^-adapted 
homogeneous and continuous Markov process with Feller transition proba- 
bility, satisfying (15) P-a.s. and such that both the integral exist, that is, for 
all t > 0, 

(19) F^j\(X s )ds + jf a 2 {X s )ds<oo S J =1. 

A transition semigroup (P t )t>o has the Feller property if for any continu- 
ous bounded real- valued function /, x i— ► P f(x) is continuous. (X t ) is thus 
a strongly Markovian process as a (right)-continuous process with Feller 
transition probability [6]. 

Let < V < oo be a twice continuously differentiable function such that 
there exist a nonnegative Borel function (ft, bounded on compact sets, a 
constant b < oo and a compact set C such that LV < —cplc c + blc- From (19) 
and the continuity of t \— > W(X t ), the process 1 1— ► Jq a(X s ){W(X s )}' dW s 
is a local martingale. Application of It's rule yields LV = AV. 

A3. For all 1 < i,j < n, d 2 a 2 (x)/dxi dxj and d 2 log7r(x)/<9:Ej dxj are locally 
uniformly Hlder continuous. 

Proposition 15. Under A1-A3, the process is reversible and ir is, up 
to a multiplicative constant, the density of an invariant probability measure. 
Any skeleton chain is irreducible, and compact sets are closed petite sets. 
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Proof. There exists a continuous function p : (t,x,y) i— ► p(t,x,y) such 
that P t (x,dy) =p(t,x,y)dy ([15], Theorem 1.1). Since the process is regular 
(or conservative, in the terminology of Kent [15]) and ir is Lebesgue inte- 
grable ([15], Theorems 4.1. and 6.2) imply that the process is time-reversible 
and 

(20) lirn J p(t,x,y) dy = (^J ir(x)dx^J J ir(x)dx. 

Hence, 7r(dx) is invariant. Irreducibility of skeletons results from (20), and 
petiteness of compact sets from the continuity of p(t, •, •). □ 

Finally, we restrict our attention to densities n that are polynomially 
decreasing in the tails. 

A4. 7r satisfies Al and A3 and there exists some < [3 < 1/n, 

n ,. |Vlog7r(x)| |Vlogvr(x)| 

< limmi iT-r — < hmsup irr\ < °°> 

NH+oo TT>\X) \x\^+oo K P {X) 

„ Tr(V 2 log7r(x)) , Tr(V 2 log7r(x)) 
2/3 - 1< liminf —±- < limsup^- < oo. 

|x|->+oo |Vl0g7T(x)| z \ x \->+oo |Vlog7r(x)|^ 

Set 7 = lim inf | x .|^ +00 Tr(V 2 log ir(x)) | V log ir(x) \ ~ 2 . 

This class is nonempty and contains the densities that are polynomially 
decreasing in the tails ir(x) = c\x\~ 1 ^ for large \x\, where < (3 < 1/n; 
in that case, 7 = (3(2 — n) > 2(3 — 1. For this family, the regularity crite- 
rion (17) or (18) says that the temperature d has to be chosen in T> n = 
[0, (1 + [3(2 — n)) /2]. For any density in the class A4, < liminfui |x|^7r(a;) < 
limsup| x | |a;|%(x) < 00. Hence, V x = [0, (1 + f3)/2] and for n > 3, 1/2 V n . 
If sup s>r supj^, i x | =a i(V log vr(x), x) =: —Q~ l < exists, then [0, 1/2 + g(l — 
n/2)] QV n . 

It is readily seen that setting V = 1 + sign(p)-7r~ p outside a compact set, 
and V = 1 otherwise, 

2 1+ir-P 

(21) 

|Vlog7r|\ 2 / oj Tr(V 2 log7r) 



1 - p - 2d + 



ttP J V |Vlogvr| 2 

for large \x\. As established in [32], Theorem 3.1, the diffusion cannot be 
geometrically ergodic when < d < (3: by choosing / := 7r d_ ^ and applying 
It's formula, df(X t ) ~ ciir^~ d (X t ) dt + C2 dWt for some constants q; and the 
drift coefficient tends to zero for a large value of the process. The process 
(f(X t ))t fails to be geometrically ergodic, and, henceforth, (X t )t itself. 
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From (21), for large \x 



IV < -cV 



l-a 



where a := 2p (f3 — d) and c> 



l + 7-p-2d>0. 



In any cases, one has to choose p such that c > 0. If a < and p > 0, then 
the process is geometrically U-ergodic ([23], Theorem 6.1). If < a < 1 and 
p > 0, the diffusion is polynomially ergodic as discussed in Section 2. If a < 1 
and p can be set negative, the process is uniformly ergodic, that is, there exist 
K > 1 and a constant c such that for all x, lim^oo K t \\P t (x, •) — 7t(-)||tv < c 
and the convergence does not depend on the starting point. This yields 
Theorem 16: the first assertion results from [28] and Corollary 6 of the 
present paper. The second and third assertions result from [23], Theorem 6.1. 

Theorem 16. Consider the Langevin tempered diffusion on W 1 , where 
the target density ir is from the class A4 and a := n~ d for some d satisfying 



(17) ifn= \ or (18) if n>2. 

(i) If < d < (3, the process fails to be geometrically ergodic. For all 
0<k<1 + 7-2/3 ; 



(ii) If /3 < d < (1 + 7)/2, then for allO<K<l + j — 2d, the diffusion is 
geometrically V -ergodic with V := 1 +ir~ K . 

(iii) If (3 < d, the diffusion is uniformly ergodic. 

Theorem 16 extends earlier results to the multidimensional case and pro- 
vides polynomial rates of convergence of the "cold" Langevin tempered dif- 
fusions for a wide family of norms. In the one-dimensional case, when d = 
([9], Result R3, page 245) only claim that the convergence in total varia- 
tion norm is polynomial, with no explicit value of the rate of convergence. 
We establish that, for a given 7r~ K -norm, the minimal rate of convergence 
is achieved with d = and, in that case, coincides with the rate of conver- 
gence of the symmetric random walk Hastings-Metropolis algorithm with 
light proposal distribution [13]. By choosing a diffusion matrix which is 
heavy where the target distribution is light, and conversely, improves the 
rate of convergence as evidenced by (22). The critic temperature is d = (3. 
For d>(3, the diffusion is no more polynomially ergodic and geometric rates 
can be reached. This critic temperature coincides with the critic one given 
in [32], Theorem 3.1, for the real- valued diffusion. 

Remark {General diffusions on W 1 ). The techniques above can be adapted 
for the analyzing of diffusions satisfying (15). Under conditions implying (a) 



(22) hm ( t + iy\\P t (x,-)-7r(-)\\ 1+n - K =0, 



T < 



1 + 7-2/3-w 
2(13 -d) 
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the existence of a solution, (b) the condition (i) of Corollary 6 and (c) the 
petiteness property of the compact sets (see, e.g., [12, 19, 36]), we are able 
to prove that when there exist M, (3, 7 > and I < 2 such that 

sup \x\~( 2+l \x,a(x)x) =: 0, sup \x\~ l Tr(a(x)) =: 7, 

{x,\x\>M} {x,\x\>M} 

sup \x\~ l (b(x), x) =: — r for some r > (7 — /3l)/2, 
{x,\x\>M} 

then the diffusion is polynomially ergodic and for all x, for all < k < 

( + /3- 1 (2r- 7 ), 

lim (1 4- tni*M - -r(OI| 1+W . =0, r < _ j _ 

3.3. Compound Poisson-process driven Ornstein-Uhlenbeck processes. Let 
X be an Ornstein-Uhlenbeck process driven by a finite rate subordinator: 

dXt = —fJ.Xt dt + dZt, 

where Z t := J2i=i Wi> {W^jj^i is an independent and identically distributed 
collection of random variables from probability measure F and {Nt}t>o is a 
Poisson-process of finite rate A, independent of the collection {Wi}j>i. Such 
processes are used as storage models (see, e.g., [18]) and have recently been 
used in financial econometrics as models for stochastic volatility (see [1]). 

The exponential decay of X, except at jump points, leads to geometric 
ergodicity of X when the tails of F(-) are sufficiently light. Here we shall 
explore the case where F(-) is extremely heavy-tailed. First we make this 
concept precise: we say a probability measure is heavy-tailed if, under that 
probability measure, for all k > 0, E[e KX ] = 00. Now let G denote the law of 
the log jump sizes, that is, G(A) = F(e ). We have the following negative 
result showing that for sufficiently heavy-tailed jumps, geometric ergodic- 
ity and even ergodicity can fail. As usual, we let tt denote the invariant 
probability measure (should it exist). 

Lemma 17. (i) Suppose J xG{dx) = 00, then X fails to be positive re- 
current. 

(ii) Suppose G is heavy-tailed, then X fails to be geometrically ergodic. 

Proof. Suppose Xq = 2 and consider the petite set C = [0, 1]. Then 
P(tc > t) > P[jump of size > e M * occurs before time log2//x] 



(23) 

poo 

= (!_ 2 -Va«) / G{x)dx. 

Jut 
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For positive recurrence, we require that E(rc) to be finite, that is, that 
P[tc > t] be integrable. However, the integrated right-hand side of (23) is 
just 

poo poo poo 

/ dt G(x)dx= / n~ l xG(x) dx = oo 

J J/it JO 

by hypothesis, so that E(rc) = oo too, so that positive recurrence must fail, 
proving (i). For (ii), we recall that for geometric ergodicity, we require that 
for some K > 0, E[e KTC ] < oo. (Although not necessary, we shall again assume 
that Xq = 2 and C = [0, 1].) Thus, from (23) we require that 

poo poo poo 

(24) / e Kt dt / G{x) dx = fin" 1 / {e KX - l)G(fix) dx < oo. 
JO J fit Jo 

However, this is precluded by the heavy-tailed nature of G, thus proving (ii). 
□ 

Examples of jump distributions for which geometric ergodicity fails (case 2 
above), though we will see that X is positive Harris-recurrent, include the 
following: 

dx 
x(\ogx) 



F(dx) = k at least for k > 1; 



e -(\ogx)P fa 

F(dx) = for some (3 < 1. 

x 

Lemma 18. Suppose that for some r > 1, m r := J °°[log(l + u)] r F(du) < oo. 
Then, X is polynomially ergodic with rate (1 +t)^'~ 1 ^ . 

Proof. For differentiable functions V in the domain of A, 

poo 

AV= / (V(x + u) -V{x))\F(du) -fixV'{x). 
Jo 

Now set V(x) = (logx) r , then by direct calculation, 
(25) = /°°((log(x + u)) rr > - (log x)^)\F{du) - /W^r - 



x 

Now the finiteness of m r merely ensures the finiteness of the first term on the 
right-hand side of (25). So, noting that (logx) rv is concave beyond x = e r_1 
for all < rj < 1, we find that, in fact, the first term on the right-hand side 
of (25) is bounded as a function of x, so that for some positive constant c, 

AV < / — v ; \F(du)c-rfi(logx) vr . 

Jo x 

It is easy to check that all bounded sets are petite in this example, and, 
therefore, the conditions for the application of Corollary 6 with a = r _1 . 
□ 
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4. Proofs of Section 2. When not explicitly denned, c denotes a generic 
finite positive constant. 9 is the usual shift operator on the canonical prob- 
ability space of the strong Markov process. 

Lemma 19. If^^ 1 is a Young function andr G Ao (resp. A), [\l/(r)Vl] G 
Ao (resp. A). 

Proof. Let r G Ao- if -1 is a continuous, increasing and convex function, 
so \£ is measurable and bounded on bounded sets ([16], Chapter 1). Further- 
more, there exists a right-continuous nondecreasing function (j) such that 
hi^{r{t))=\nr(t)+\n{r(t)- 1 f Q (t) <j){s)ds}; thus proving that ln\]/(r(i))/i J. 

as t — > oo. This yields G Ao- The second assertion deduces easily from 
the definition of A and the upper bound sup t>1 ^(at)/^(t) < oo for all a > 
([16], Chapter 1, pages 7 and 8). □ 

While Theorem 1 and Corollary 6 are claimed for a rate function r G A, 
Lemma 19 shows that they can be established for a rate r G Ao, and we will 
do so. 

4.1. Proof of Theorem 1. Without loss of generality, we assume ^(r*) > 

1 and * 2 (/*) > L 

Lemma 20. Let r G Ao and f > 1 be a Borel function. For any closed 
set C such that sup^ Gq (•, / ', r; 5) < oo, there exists a constant M < oo such 
that for allx£X andt>5, G c (x, f,r;t) < M^/^G c (x, f,r;6). 

Proof. The proof is on the same lines as the proof of Lemma 4.1 
in [20] that addresses the case r = 1, and the details are omitted. Using 
the property r(s + t) < r(s)r(t) ([34], Lemma 1(d)), we obtain M = 1 + 
sup t>5 [r(t)/r°(t)] x sup c Gc(-, f,r;5), which is finite since linn. r(t)/r°(t) = 
(this is a consequence of [34], Lemma 1). □ 

Proposition 21. Let r G Ao and f > 1 be a Borel function. Assume 
that X is 4> -irreducible and sup c Gc(-, f, r;5)<oo for some closed petite set 
C and5>0. x i— > Gc(x, f,r;5) is finite ^-almost surely for some (and then 
any) maximal irreducibility measure ip, and C is accessible. 

Proof. By [20], Proposition 3.2(h), for all A > 0, there exist a posi- 
tive integer m and a maximal irreducibility measure tp such that tjj(-) < 
inf xe c R™(x, •), where R\ is the resolvent kernel R\(x, •) = J Xexp(—Xt)P t (x, •) 
By Lemma 20, R\Gc(-, f, r; 6)(x) < cGc(x, f, r; 5), where c is finite for some 
convenient A. Hence, ipGc{-, f,r;S) < oo, proving the first statement. This 
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implies that there exists an accessible set B such that sup^^E^ [tc(<5)] < 
sup xeB G c (x,f,r;5) < oo. Then for q large enough, inf^gB F x (t c (S) < q) > 
and, for any x, K x [r]c] > P n (x, B) inf^gs Fx(tc(8) < q) > for some n de- 
pending upon (x,B). Hence, C is accessible. □ 

Proposition 22. Suppose assumptions (i) and (ii) of Theorem 1. Then: 

(i) There exist t$ and a measure v such that inft>t inf^gc P t ( x , •) > ^(Oj 
and z^(C) > 0. 

(ii) For any set B such that v{B) > 0, E x [r°(T m>B )] < RB^x[r (r c (6))] for 
some finite constant Rt,B- 

(hi) For any t > and any accessible set B, K x [r°(TB{t))] < Rt,B^x[r°{Tc{o~))] 
for some finite constant Rt,s ■ 

Proof, (i) Results from R2, Proposition 21 and Lemma 2. (ii) Let to 
and v be given by (i). Set r = Tc(io +rn); and define the sequence of iterates 
r 1 = r and for n > 2, r n = r™" 1 + r o rn_1 . Finally, let (u n ) n > 2 be a {0, 1}- 
valued process given by u n = 1 if -X"r( r n-i+t )/ m ]m e ^ anc ^ otherwise, [t] 
denotes the upper integer part of i. Then u n E W n with H n = a(X t , t < r n ), 
and by the strong Markov property, P x (un = l|W n _i) > v{C) > for n > 2. 
Finally, set 77 = inf{?i > 2,u n = 1}, so that E x [r (T m<B )] < E x [r°(T^)\. Using 
again the strong Markov property and the inequality r°(ti + t 2 ) < + 



(26) E^rV)] < J2 E x [r (r n )h>n} = £ + supE^r)]^) 



for all n > 2, where a x (n) = E^fr ^" 1 )!^] and ^(n) = E as [r(r n ^ 1 )l„>„]. 
Since, by Lemma 20, sup^ E^fr ^)] < oo, there exists < p < 1 and a finite 
constant c such that 



and b x (2) = E x [r(r)], a x (2) = E x [r°(r)]. The proof is on the same lines as the 
proof of [25], Lemma 3.1, and is omitted for brevity. Hence, K x [r° (T mj B)] < 
c(K x [r°(T)] +E x [r(r)]) for some c < oo. The proof is concluded, applying 
again Lemma 20 and the bound sup f>a r(t)/r°(i) < oo for all a > (see the 
proof of Lemma 19). 

(iii) B is accessible and C petite so there exist to > and 7 > such that 
inf x .g C 'IP a ;(TB < t +t) > infxecPx(TB < *o) > 7- Set r = T C (t + t ) and u n = 1 
if for some r n_1 < s < r n_1 + i -Moj -^s £ -Sj an d u n = otherwise. Following 



r(ti)r°(t 2 ) [34] 



n>2 




Mn)</**(«-i) + c(i-KC)) . 

a*(n) < (1 - v(C))a x (n -l)+b x (n- 1) supE x [r°(r)]; 
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the same lines as in the proof of (ii), it may be proved that there exists 
c < oo such that E x [r (rs(t))] < cE x [r°(Tc(t + to))]. The proof is concluded 
by applying Lemma 20. □ 

Proposition 23. Suppose assumptions (i) and (ii) of Theorem 1. For 
any (^1,^2) £ T, C is a (^2(f*),^i(r*)) -regular set for the process, that 
is, sup c GB(-,^2(f*),^i(r^);t) < 00 for any t > and any accessible set B. 
Gb(x, ^2(/*)) *i(?**)>*) < 00 f or a M x e an( ^ tt(«5*) = 1- 

Proof. (^2(/*), ^ / i(^*))-regularity is a consequence of Young's inequal- 
ity (4), the (/*, 1) -regularity of C ([20], Proposition 4.1) and Proposition 22(iii) 
For the second statement, write 



G B (x,* 2 (/*),*i(r*);i) 

<Gc(s,¥2(/»),tfi (»♦);*) +E a 



3O0 T C(*) 



ro(*) 



*i(r*(s))*2(/*(^.))ds 



The result now follows from the strong Markov property, Lemma 20 and the 
inequality ^i(r*(s + t)) < ^ i(r^(s)) 1 i' i(r*(t)) , which holds since $ior, G Ao- 
Finally, 7r(<S#) = 1 by Proposition 21. □ 



Proposition 24. Suppose assumptions (i) and (ii) 0/ Theorem 1. TTte 
skeleton chain P m is tjj -irreducible and aperiodic and possesses an accessible 
petite set A such that for all (^1, ^2) £ T, 



(27) 



sup E 2 



. k=0 



< OO. 



Proof. For the definitions of accessibility, smallness, petiteness and 
aperiodicity of a discrete-time Markov chains, see [21]. From Proposition 22(i) 
C is small for the skeleton P m and the skeleton is aperiodic (Theorem 5.4.4, 
[21]). In addition, by R2, the skeleton is positive and 7r(/*) < 00. Let C n 
be a petite set (for the skeleton P m ) such that A = C n C n is of positive 
^-measure 



(28) 



sup E a 



fe=0 



f*{Xkm) 



< OO, 



for any accessible set B (for the skeleton); the existence of such a set is a 
consequence of Theorems 14.2.3 and 14.2.11 in [21] and Proposition 22 (ii) . 
The set A is accessible and petite for the skeleton. (27) now results from 
Young's inequality (4), (28) and Proposition 22 (ii). □ 
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Proof of Theorem 1. By Proposition 24 and [35], Theorem 2.1 and 
Proposition 3.2, limn^oo ^i(r*(n))||P" nm (x, •) — 7r( - )ll* 2 (/*) = for 7r a.a. x. 
By Jensen's inequality, the upper bound sup t>1 ^(^/^(i) < 00 for all 
a > 0, and assumption (hi), we have for all t <m, P t ^2(f*) < c^2(f*)- In 
addition, since ^i(r„) G A, ^i(r*(n + t)) < c^fi(n{n)) for all t < m ([34], 
Lemma 1). Hence, 

(29) lim^ 1 (r,(t))||P i (x,-)-^(-)lk 2 (/») = ° 5 vr a.a. x. 

We now prove that this convergence occurs for all x G which is of ir- 
measure one, by Proposition 23. To that goal, we mimic the proof of [22], 
Theorem 7.2. By Egorov's theorem, there exists a set A, tt(A) > 0, such that 
(29) holds uniformly for all x € A. For all Borel functions, g G .Cj,^^), set 
g:=g-ir(g). Since ^i(r*) G A , 

^i(n(t))\E x [g(Xt)tr A <t]\<^i{n{t)) /"'suplP'-'sKjOPsCTAeds) 

JO y&A 

< M{E x [^ x {u{ta))} +E x [y 1 (r*(T A ))± TA >t/2]}, 

where M = sup yeA sup s>0 r^(s)\P s g\(y). Let x&Sq,; from Proposition 23, 
Ea,[*i(r»(TA))]< 00 aadBm t _» 00 E a .[*i(r,(T A ))l rA >i/ 2 ] = 0. Since the limit (29) 
holds uniformly for all x G A, M is finite. Hence, lim^oo ^ i(r*(t))\E x {g(X t ) x 
lr 4 <t]| = uniformly for all g G £* 2 (/»)- 

Since tt(/*) < 00, \E x [g(X t )l TA > t }\ < cE x [$ 2 {f*(X t ))l TA > t }. Following the 
same lines as in the proof of [22], Theorem 7.2, using again sup tt<m P u f* < 
cf*, we obtain 

Vi(n(t))E x {y 2 {f*{x t ))i TA > t ] 

<c*i(r*(m)) inf *i(r»(*-«))E a .[* 2 (/.(^ t -«))ir A >t-u]. 

0<u<m 

By Proposition 23, (x, ^(Z*), (^*); 0) < 00, which implies that the up- 
per limit in the right-hand side is zero, proving that lim^oo \I'i(r*(i)) x 
^x[^2(f*(X t ))t TA > t ] = 0. Hence, uniformly for g G Ap a (/«)> lim t-*°o *i (»"*(*)) x 
|E x [^(Xt)l TA >t]| = 0. This concludes the proof. □ 

4.2. Proof of Corollary 6. Set /* := V l ~ a and r*(i) := (t + l) 1 / 0-1 . 



Lemma 25. Suppose assumption (ii) 0/ Corollary 6. For any a < 77 < 1, 
£ > 0, and any Tt- stopping-time t, 



Cr]E x 



rrAt 

/ V^- a {X s )ds 
Jo 



+ E x [V T >(X TAt )]<V r >(x)+bE x 



rAt 



t C {X s )ds 
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Proof. By definition of AV, 



r PTAtATr 



c v E x / V"- 



(X s )ds +E x {V T >(X TMATn )) 



<V v (x)+bE x 



- rrAtAT, 

Jo 



t C {X s )ds . 



The right-hand side is upper bounded by V(x) + bt and by the monotone 
convergence theorem, it converges to V(x) + bK x [fQ M 1q(X s ) ds] as n — > oo. 
The lemma now results from Fatou's lemma. □ 

Proposition 26. Suppose assumption (ii) of Corollary 6. For all 5 > 0, 
there exists c < oo such that for all x £ X , Gc(x, 1, r*; 6) < cV(x). 

Proof. Set q := Ll/ a J ? where [ - J denotes the lower integer part. By 
Lemma 25, we have E^frc] < cV a (x) and by Jensen's inequality, we obtain 

Ex [t^ ~ q ] < cy 1_<?Q (x). We prove by induction that for all integer 1 < I < q, 

^x[rf ~ l ]<cV l - la {x). The case I = q holds; assume it is verified for some 
2 < I < q. The induction hypothesis and Lemma 25 yield 



which concludes the induction. For I = 1, this yields Gc(x, 1, r*; 0) < cV(x). 
Finally, by standard manipulations and Lemma 25, we have Gc(x, 1, r*; 5) < 
c(l + P 5 V(x)) <cV(x). □ 

Proof of Corollary 6. We check the conditions for the application 
of Theorem 1. Lemma 25 and Proposition 26 imply Gc(x, /*, 1; <5) < cV(x) 
and Gc(x, 1, r*; 5) < cV(a;), from which we deduce the condition (ii) of The- 
orem 1, and by Rl, condition (i) of Theorem 1. Condition (iii) follows from 
Lemma 25. Finally, = X . □ 
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